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Introduction 


While begun with great hope and promise, the project “Utilization of Genomic Signatures to 
Direct Use of Primary Chemotherapy in Early Stage Breast Cancer” encountered problems with 
data integrity in the basic science underpinning the clinical trial. Consequently, the CDMRP 
requested early termination of the protocol.. The issues eventuating in this outcome are detailed 
in a number of sources.’ The findings of the Institute of Medicine were reported in a summary 
document on March 23, 2012.? The clinical trial was permanently closed to accrual, and further 
payments on the award cancelled. 


The clinical trial primary aim was to validation genomic predictors of chemosensitivity based on 
work conducted by Anil Potti, MD while in the laboratory of Joe Nevins, PhD in Duke’s Institute 
for Genome Sciences and Policy. The reviewers found the promise of chemotherapy predictors 
to be a strong consideration in funding the grant, particularly in the context of a prospective 
randomized clinical trial. It is, however, the scientific work underlying these predictors that was 
found to contain corruption of the original datasets used for validation. While the data 
corruption detected is the subject of an ongoing scientific misconduct investigation at Duke, the 
systematic nature of the data changes in this and other work by Dr. Potti point to data 
manipulation as the only reasonable explanation. The paper describing those signatures has been 
retracted.* Given these developments, continuing with a validation trial for these signatures 
prospectively was no longer appropriate. 


This final report will summarize the work done to institute the clinical trial, including profiling 
work done on additional samples as part of that work. A considerable genomic dataset was 
created in this process, and we hope that it will serve some public use and provide some positive 
outcome from this experience. The project did meet some of the original goals. For example, 
one goal of the original proposal was to demonstrate the conduct of a prospective preoperative 
treatment trial using an integral full-genome expression-based biomarker design with pathologic 
complete response as an objective and accepted primary endpoint. While we acknowledge the 
weaknesses in the effort highlighted by the IOM , we have demonstrated the feasibility of this 
approach. 


Body: 


Our original CTRA proposal was motivated by two key concepts: 1) designing a trial with real- 
time genomic profiling integral to assigning patient treatment; and 2) implementing this in a 
preoperative trial so that relevant clinical endpoints (pathologic complete response; clinical 
response) could be assessed. Work in the Duke cancer genomics group had focused on 
characterizing a number of biologic features using gene expression profiling: 1) prognosis”; 2) 
oncogenic pathway activation®’; and 3) chemosensitivity prediction®. Our proposal had the 
following objectives: 
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1. Develop and evaluate the clinical infrastructure and methodologies that will be central to the 
use of genomic profiling to assign patients to treatment. 


2. Develop and conduct a Phase II clinical trial that will evaluate the capacity of genomic 
signatures to improve the efficacy of chemotherapy selection for the individual patient. 


3. Make use of the data and results from the Phase II study to develop a follow-up Phase III 
clinical trial that will compare genomic-guided vs. standard treatment strategies, and will 
identify novel therapeutic opportunities for those patients likely to be resistant to standard 


cytotoxic therapies. 


Of the “signatures” developed, the chemosensitivity signatures were the most easily integrated 
into a preoperative treatment protocol that would comport with commonly accepted standards of 
clinical care. We designed and initiated a prospective Phase II trial integrating the genomic 
signatures of chemosensitivity to doxorubicin and docetaxel into a preoperative trial. 


The following describes the work and accomplishments for each of these objectives: 


1. Develop and evaluate the clinical infrastructure and methodologies that will be central to the 
use of genomic profiling to assign patients to treatment. 


1) Development (completed prior to study): 


a. Process raw gene expression data for training 


b. Select features with strongest (Pearson) 
correlation to phenotype. 


c. Summarize expression by a singular value 


decomposition (SVD) [Factors derived under 


Type 1 or Type 2 model} 


d. Build a Bayesian probit regression model to 
predict phenotype 


2) Prospective Validation: 


a. Process investigational set normalized by SVD 


or scale/shift standardization. 


b. Summarize expression values using SVD 
factors 


c. Apply model to obtain a predicted probability 


of clinical outcome 
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Figure 1: Schematic 
showing steps for 
processing expression 
data from cell line 
signatures and 
normalizing one-at-a- 
time clinical samples 
into standardization 
set for generating 
predictions. 
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We were able to develop the infrastructure [a 
to generate full genome expression data in 
a CLIA environment (compliant from study 
start, and approved subsequently), and a 
bioinformatics pipeline to analyze that data 
and use it for patient treatment 
assignments. The chemosensitivity 
signatures were based on cell-line 
expression data, so expression data from a 
comparison set of breast cancers was used 
to standardize the cell-line based signatures 
for making predictions of chemosensitivity. 
This process is outlined in Figure 1 and 
discussed further in “Intratumor B 10 

Heterogeneity and Precision of Microarray- I \ 
Based Predictors of Breast Cancer Biology 
and Clinical Outcome”. ° This paper also 
presents the analytic and precision data 
obtained for the Affymetrix U133A plus 2 
arrays and the RNA extraction process that 
was used in the course of the trial. The i 
chemosensitivity signatures presented in : 


0 02 4 08 os 110 


1.04 Doxorubicin Sensitivity 


Probability 


Probability 


that paper were the versions used in the Doxorubicin Sensitivity 
trial, and had been provided by Dr. Potti as Figure 2: Demonstration of reproducibility of multigene 
being the signatures used in the prior signatures in bioreplicates. 


validation work. Some adjustments were 

made to transfer the signatures to 

performance on the U133A plus2 arrays. The precision of these multigene signatures in the 
replicate dataset is illustrated in Figure 2 from the paper. 


2. Develop and conduct a Phase II clinical trial that will evaluate the capacity of genomic 
signatures to improve the efficacy of chemotherapy selection for the individual patient. 


We also set up a screening process for patient enrollment, detailed in Figure 3. 


The biopsies were reviewed by Dr. Joseph Geradts for quality and cellularity, and the best 
available core used for profiling. The infrastructure for extracting quality RNA was set up in the 
Clinical Trials Support Facility in collaboration with the IGSP, and was supervised by Dr. Mike 
Datto. Extracted RNA was then quantitated and then sent to Expression Analysis for 
hybridization to Affymetrix U133A plus 2 microarrays. The resulting .CEL files were uploaded 
to a secure server in the IGSP and available only to Dr. William Barry and his biostatistical team. 
The array data was analyzed by QC metrics to assure adequate signal, and then chemosensitivity 
predictions using the now discredited signatures were made. The turnaround times and overall 
success for processing samples are given in Figures 4 and 5. 
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Frozen Core Not 
Available: Research 
core obtained 
(+/- sentinel node 
mapping) 


Protocol Assigned 
Chemotherpay Initiated 


Patient Randomized 


Microarray Data 


Analyzed 


Bank cores used) Available 


Median Max 


RNA processed to Microarray data available 


(Purple) Cancer diagnosis to Initiation of treatment 


(Blue) Tissue sample is available to Microarray data available 


(Green) Tissue sample is available to Initiation of treatment 


(Red) Protocol consent to Initiation of treatment 


Additional days needed for microarray data processing 


2012 


Figure 4: Steps and turnaround times for obtaining, processing, and generating data from core biopsies. 
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and 
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+ Failed profiting: 1 


Figure 5: Metrics for sample processing success/failure and obtaining expression data 


for use in treatment assignments. 


Under 40 
40-49 
50-59 
60-69 
70-79 


White 
Black 
Hispanic 
Not Reported 


Ductal 
Lobular 
Other 


ER positive 
ER negative 
ER unknown 


Table 1: Demographic and clinical characteristics 
of patients. 


Table 1 shows the final accrual to the trial 
before termination. The distribution by 
race, tumor size, histology, and receptor 
status is consistent with what was 
anticipated. 


The trial design is given in Figure 6. The 
design is innovative in being an integral 
biomarker validation trial. In such a 
design, obtaining the biomarker (i.e. high 
quality Affymetrix array data) is required 
before randomization in the trial. The 
requirement means that for every patient 
enrolled in the trial, the required biomarker 
data will be available. For other designs, 
where the marker validation is a secondary 
endpoint, this is not the case, and such 
designs often fail to meet their objectives 
because of missing biomarker data. 
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Unbinding of Treatment Assignment 


signatures were skewed in 
favor of one regimen or 
the other. 


Figure 5: Clinical trial schema and patient assignments, with pCR cases at 
bottom. 


The primary endpoint for the study was pathologic complete response in the breast, defined as 
resolution of all invasive cancer. Nodal status was not considered since prechemotherapy 
sentinel node mapping was allowed in the trial. The treatment assignments were to four cycles 
of standard dose doxorubicin/cyclophosphamide or docetaxel/cyclophosphamide, both standard 


regimens for treatment of early 
Overall pCR stage breast cancer. Patients 
underwent treatment, and then 


9 % CT: 4-279 were Clinically reassessed. If 
Toa) additional chemotherapy was 
; ; thought necessary by the 
Genomically-guided (n=26) Control (n=13) treating physician, that was 
15% (4/26) 8% (1/13) allowed, but patients were 
P=0.64 asked to undergo an optional 


repeat biopsy to established the 
presence/absence of residual 
pCR pCR cancer at that time. While that 
design feature had the potential 
to undermine the scientific 
conclusions, it was felt 


Genomic No Yes Total Genomic No Yes Total 
Prediction Prediction 


Resistant 9 2 11 Resistant 8 1 9 necessary to avoid 
Sensitive 6 2 8 Sensitive 7 0 7 compromising patient 
Total 1 16 management. 
Total is |Z! Jie) ola 
Fisher’s exact p>0.99 Fisher’s exact p>0.99 Table 2: Primary analysis of trial 


endpoint for patients enrolled, and 
analysis by treatment regimen and prediction of response use discredited chemosensitivity signatures. 
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The results by pCR in each of the arms and by regimen are given in Table 2. The analysis by 
regimen was for actual regimen received, not intention to treat, since three of the TC patients did 
not have definitive surgery following completion of protocol directed therapy and one had an 
infusion reaction with cycle one and was treated with AC. The trial is, of course, underpowered 
to show any difference between arms or regimens. However, given the controversy regarding 
the chemosensitivity signatures, and the supposition that using the signatures to assign treatment 
might have led to treating them with the wrong drug, we undertook this analysis to see if that 
appeared to be the case. The results suggest that this is very unlikely. 


We will submit the conduct and results of this trial for publication. This will include data that 
has been presented at several meetings, including ASCO and the ASCO/EORTC/NCI 
Biomarkers Meeting in October, 2010. ‘°(See Appendix) Despite the flaws in the underlying 
science, hopefully the data will prove useful in thinking about the conduct of an integral 
biomarker-validation trial design. As well, the annotated expression data can provide a potential 
pilot validation set for biomarkers attempting to distinguish anthracycline and taxane sensitivity 
in HER2 negative disease. 


Table 3: Samples and analyses conducted. 


3. Make use of the data and results 
from the Phase II study to develop 
a follow-up Phase III clinical trial 
that will compare genomic-guided 
vs. standard treatment strategies, 
and will identify novel therapeutic 
opportunities for those patients 
likely to be resistant to standard 
cytotoxic therapies. 


Since the trial was terminated, we 
could not proceed as planned with 
this objective. Instead, we sought 
to salvage the value of the genomic 
information obtained in this 
limited dataset by extending the 
analysis beyond gene expression 
profiling on the samples in the trial 
only. The additional samples and 
analyses conducted were proposed 
and accepted in the Annual update 
for Year 4, and are summarized in 
Table 3. The following analyses 
are either completed or underway 
for the data obtained from these 
samples: 


n= number of potentiial cases; some with replicates 
Not applicadle 

of not 7 
accomplished 
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Post Treatment Profiling 

A total of 37 samples were collected from the post treatment cancers. This set is again 
unique for examining gene expression changes between doxorubicin and docetaxel 
treated patients. While this dataset will be small, that is true of most similar datasets; 
they are challenging to develop. Nevertheless, this sample set can be used for discovery 
purposes and as a potential validation set for predictors |! 


Peripheral Blood Gene Expression Profiling 

PAX gene tubes for isolation of peripheral circulating RNA were obtained at baseline and 
after protocol directed therapy on all patients. The RNA has been extracted from these 
samples, and will be profiled using Affymetrix arrays to look and changes in circulating 
RNA. These changes can be compared to changes in the primary tumor and provide a 
pilot set for development of response and toxicity prediction signatures. 


. Expansion of bioreplicate cohort 

Most of the cases enrolled in the trial had at least one additional pretreatment core taken. 
We have profiled those cores to expand the replicate dataset already published. This 
work was recently presented at the 2012 San Antonio Breast Cancer Symposium, and 
will be expanded further to look at reproducibility as a function of tumor cellularity and 
to look at reproducibility of published prognostic and predictive signatures, and is 
included in the appendix. 


Pre/Post Breast MRI Patterns and Gene Expression Profiles 

This dataset can also be analyzed for baseline breast MRI patterns and gene expression 
profiles. While some work has been done in this area, the tumors are generally larger, 
locally advanced cancers.’” This dataset has smaller tumors represented. As well, the 
data on focused regimens is unique. Prior work, including some at Duke, suggests that 
MRI patterns of enhancement can predict response. We can look at similar imaging 
patterns in this cohort of patients, and also analyze by subtype. 


Cross-platform Genomic Prognostic Signatures. 

Using the 19 patients in the study with clinical Oncotype Recurrence Scores as a starting 
set, we have assembled a cohort of 104 cases from the SPORE tumor registry with 
concurrent Oncotype scores as a comparison set. This analysis was presented in poster 
form at the 2012 San Antonio Breast Cancer Symposium and is included in the appendix. 


Proteomic Analysis Somatic Gene Mutations 

We have done proteomic profiling of pre/post AC treatment cores on 17 patients using 
2DLC-MS/MS analysis. Initial analysis of the data has been performed and indicates that 
this approach is possible, and that changes in protein expression can be detected. We are 
attempting to integrate this data with gene expression data and complete the primary 
analysis for protein expression changes that might be associated with anthracycline 
sensitivity and resistance. 
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7. Somatic Gene Mutations 
With the processing of replicate cores, we were able to extract tumor DNA from 12 pre- 
treatment cores and 6 matched post-treatment cores from the AC treated cohort of 
patients. In a separate collaboration, these samples are being analyzed with whole exome 
sequencing to look at potential changes is somatic mutations detected. As well, four of 
these cases had pathologic complete responses, so a preliminary discovery effort can be 
made to look for mutations predicting AC response. 


8. Lymph Node Predictor 
Previous work in the Duke Genomics group focused on development of a predictor of 
lymph node involvement.*’*"* The clinical applicability of the lymph node predictor was 
unclear at the time this work was performed. However, with new studies suggesting that 
completion axillary dissection is not necessary for patients with limited lymph node 
involvement, the performance characteristics needed for a lymph node predictor have 
changed. The datasets described here provide a further opportunity to validate and/or 
further develop a lymph node predictor. 


Key Research Accomplishments 


e Demonstration of the performance of the clinical trial infrastructure, including rates of 
successful profiling, quality assurance of genomic data, and time metrics for assay 
performance. 

e Randomization of 39 patients in the prospective trial, with associated biospecimens 
(including frozen tumor, germline DNA, and blood). 

e Generation of a small but valuable dataset for potential validation of published 
chemosensitivity signatures from other groups, particularly for comparing anthracycline- 
and docetaxel-based treatment in isolation. 

e Dataset of expression profiling analysis of 104 estrogen receptor positive early stage 
breast cancers annotated with clinical Oncotype Recurrence Scores. 

e Dataset of expression profiling analysis of 46 cases with between 2 and 5 bioreplicate 
cores of varying cellularity useful for studying reproducibility of gene expression based 
prognostic and predictive signatures. 

e Proteomic data on 17 matched pre/post AC treatment cores. 

e  Peripheral/blood gene expression profiles on 34 patients pre/post chemotherapy exposure. 

e To be completed with support from other funds: whole exome sequencing on pre/post AC 
treated cases. 


Reportable Outcomes 


The following work related to this project has been presented at meetings and/or published. 
Please see the documents appended at the end of this report. 
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e Poster 1: “Development and Implementation of Genomic Predictors of Chemotherapy 
Response for Guiding Preoperative Therapy in a Prospective Breast Cancer Trial”, NCI 
Translational Science Meeting, November 5-7, 2009. 

e Poster 2: “A randomized phase II trial evaluating the performance of genomic expression 
profiles to direct the use of preoperative chemotherapy for early-stage breast cancer” 
ASCO Annual Meeting, 2010. 

e Poster 3: “Utilization of Genomic Signatures for Chemotherapy Response in Prospective 
Clinical Studies”, ASCO Annual Meeting, 2010. (Poster Discussion link on ASCO Virtual 
Meeting is given in Appendix) 

e Article 1: “Intratumor Heterogeneity and Precision of Microarray-Based Predictors of 
Breast Cancer Biology and Clinical Outcome”, Journal of Clinical Oncology, 29 (13): 
2198, May 1, 2010. 

e Poster 4: “Pathologic response analysis for a genomically guided preoperative 
chemotherapy trial”, Era of Hope Meeting, 2011. 

e Poster 5: “Generation of real-time full-genome expression data for treatment assignment 
in a prospective breast cancer trial”, Era of Hope Meeting, 2011. 

e Poster 6: “A Cross-Platform Comparison of Genomic Signatures and OncotypeDx Score: 
Discovery of Potential Prognostic/Predictive Genes and Pathways”, SABCS, 2012. 

e Poster 7:” Retrospective evaluation of precision of gene-expression-based signatures of 
prognosis and tumor biology in replicate surgical biospecimen from patients with breast 
cancer”, SABCS, 2012. 


Conclusion: 


Given the collapse of the scientific rationale for the clinical trial, we have attempted to salvage as 
much meaningful data from the project as possible. We maintain that patients still received 
optimal standard of care treatment, in a study design that allowed individualization of treatment 
while also meeting the scientific goals. All of the patients were made aware of the scientific 
misconduct issues. The clinical team undertook this validation trial in good faith, believing that 
the peer-reviewed published science justified the prospective study, and that the trial design 
abided by standard of care treatment. Importantly, as outlined above, the trial also held promise 
for establishing the process for integrating a complex biomarker into clinical trial design, 
allowing for correlation of the underlying biology with other clinically relevant factors. 
Hopefully, we can still in some way honor the patients who agree to participate in this study by 
the work detailed above. 


In retrospect, the data supporting the conduct of the trial was not adequate to justify proceeding. 
This report has not attempted to address all of the issues raised in the IOM report or the 
controversy related to the scientific misconduct. Unfortunately, it is undoubtedly true that the 
ensuing scandal has damaged the field of translational genomics. While those issues went well 
beyond this project, we profoundly regret the damage done and any connection to this project. 
Duke University has investigated this matter and developed policies to ensure the integrity of 
data supporting translational clinical research.'*'® While we clearly fell short of the goal, we 
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hope the data generated will provide helpful clinic-genomic datasets to the breast cancer research 
community, and will release the data generated in a manner consistent with the Translational 
Medicine Quality Framework policies. 
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Appendices 


1. Funded personnel during conduct of the project. 
2. Paper and Abstracts cited in Reportable Outcomes. 


Funded Personnel: 

Paul Kelly Marcom MD- Overall study Principal Investigator, years 1-5 

Anil Potti MD- Scientific advisor, years 1-3. No role in trial conduct or analysis of any data in this report. 
John Olson MD, PhD- Surgical co-investigator 

Joe Nevins PhD- Scientific advisor, years 1-5 

Joseph Geradts MD- Study breast pathologist 

Sujata Ghate MD- Study breast radiologist 

Bill Barry PhD- Study biostatistician 

Jeff Marks PhD- Scientific investigator 

Mike Datto MD, PhD- Study molecular pathologist 

Bob Annechiarico- Study Bioinformatics 

Traci Foster RN, BSN- Overall project coordinator years 1-4 

Jamie Cuticchia PhD- bioinformatics 

Geoff Ginsburg MD, PhD- IGSP investigator and Director of Genomic Trials 
Tim Veldman- Project manager for |GSP 

Ryan Griffis- Molecular lab technician 

Jvonne Hunter- Molecular lab technician 

Kirk Gray- Cancer Center bioinformatics and Oracle case report form developer 
Holly Dressman- IGSP microarray facility director 

Caroline Lentz- Clinical protocol nurse 

Vlayka Liotcheva- Clinical Research Coordinator, years 2—3, project coordinator years 4-5 
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ABSTRACT 


METHODS: 


Background: Genomic assays have been shown to provide 
the potential for personalized approaches to breast cancer 
therapy. While assays based on fixed tissues offer greater 
convenience, the use of fresh tissue samples provides a 
much broader opportunity to interrogate the critical 
underlying biology. Full-transcriptome assays using 
microarrays are challenging, but have the advantage of 
providing multiple prognostic and predictive signatures in 
one assay. Using an Affymetrix platform, we have 
developed genomic predictive signatures for various 
clinically relevant cancer characteristics. Previous 
credentialing studies have demonstrated the sensitivity 
and specificity of these signatures in retrospective studies. 
We report here on the investigation of the analytic 
performance of these signatures in biologic replicates, and 
their application in a prospective randomized clinical trial 
designed to validate predictors of chemosensitivity. 
Methods: To investigate the impact of intratumoral 
heterogeneity on the genomic signatures, patients with 
multiple frozen cores were identified in the Duke Breast 
SPORE tissue repository. Cores were assessed for percent 
invasive cancer cellularity, primary tumor size, and 
standard biomarker assessments. RNA was hybridized to 
Affymetrix H133 Plus 2.0 microarrays and the data was 
used to evaluate gene expression signatures generated 
according to previously identified predictors of sensitivity 
to adriamycin and docetaxel (Potti et al, Nat Med, 2006). 
Genomic predictors of hormone receptor status were 
applied to post- processed array data, and compared to 
single-patient measures from IHC/FISH. The resulting 
analyses support the currently enrolling trial, "Performance 
of Genomic Expression Profiles to Direct the Use of 
Preoperative Chemotherapy for Early Stage Breast Cancer" 
a prospective randomized trial designed to validate 
genomic signatures for predicting response to doxorubicin 
or docetaxel treatment in HER2 negative cancers. 


Results: Fifty-one individual samples from 18 patients were 
profiled to investigate the contribution of intratumoral 
heterogeneity to signature variation. The interclass 
correlation for doxorubicin and docetaxel sensitivity 
predictor in the replicate samples was 0.72 and 0.64 
respectively (p < 0.0001). Expression data was also 
analyzed for a novel predictor of ER pathway activation. 
Predicted ER status among replicates showed perfect 
concordance using this pathway signature. The 
infrastructure has been established for conducting 
microarray analysis in support of the clinical trial, and 
provides microarray data with an approximately 5 day 
turnaround from biopsy to data availability. Multi-center 
accrual to the study is underway. Conclusions: Full 
genome microarray expression profiles are robust and 
reproducible, and can be practically obtained and applied 
in the context of a prospective clinical trial. This single 
assay can provide data that can be analyzed for a variety of 
clinically useful prognostic and predictive signatures. 


In this prospective trial, genomic assessment is performed on 
tumor samples in real-time using Affymetrix U133Plus2.0 
microarrays. Previously described “signatures” are then used in 
conduct of the clinical trial. Figures one and two outline the 
procedures for tissue acquisition and the design of the clinical 
protocol, respectively. The results describe the metrics for 
performing this assay in the context of this trial. 


Figure 1: Patient eligibility and 
tissue acquisition 
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Figure 2: A Randomized Phase II Trial Evaluating 
Performance of Genomic Expression Profiles to 
Direct the Use of Preoperative Chemotherapy for 

Early Stage Breast Cancer 
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Figure 3: General steps in prospective 
classification of microarray samples 
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RESULTS: 


Figure 4: Signature Concordance on 
Bioreplicate Cancer Samples 
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Table 1: Sample collection and 
microarray profiling success 


REASON N % “Patients 1004 and 1018 declined second biopsies and withdrew consert 
Screening Failures *Patient 1023 had a low Ongatyne score end withdrew consent 

Histology O/C Failure 1 3 to be treated with chemotherapy. 

RNA Q/C Failure 1 3 ; ae: ; 

Array O/C Failure 0 0 “Patient 1007 had no invasive tumor in the research cores and 

y : . wes classified as a Histology QUC failure. 

Technical or Processing Error 1 3 

Not Eligible 0 0 “Patient 1013 had multi-centric tumor, but did not have enough RNA 

Withdrew Consent 3 10 for genetic testing. They were classified as an RNA GUC failure. 
Total Screening Failures 6 ERE) “Patient 1015 was a RNA hybridization failure. They were classified 
Randomized 23 79 as a technical or processing error. (See Figure X for QC plot of principal 
Total 79 100 components for this analysis) 


88% successful profiling rate 


(95% Cl: 70% - 98%) 


Figure 5: Microarray quality control 
(QC) plot for sample 1015 identified 
hybridization failure 


B. Plots: PCAs (shown here), 
probe-level intensities, etc. 


A. Metrics 


Metrics described in Qwaar, Barry et al, Clinical Cancer Research, 2008 (Ref. 3) 
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Figure 6: Timeline for protocol steps 
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Figure 7: Individual patient timelines and 
impact of profiling on treatment initiation 
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(Mean= 2.75 days) 


CONCLUSIONS: 


1. True biologic replicates demonstrate substantial concordance for 
complex multi-gene expression signatures. 

2. Real-time, one-at-a-time, full genome microarray expression 
analysis using fresh frozen tumor samples and an Affymetrix 
platform as a clinical assay is feasible. 

3. Profiling success rates and turn-around times are acceptable for 
using this approach for making genomically guided treatment 
assignments in a preoperative therapy breast cancer trial. 

4.In addition to the chemosensitivity predictions being evaluated in 
the current trial, the same full genome expression data can be 
analyzed for other clinically relevant prognostic and predictive 
factors. 
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Background: Genomic assays have been shown to 
provide the potential for personalized approaches to 
breast cancer therapy. Full-transcriptome assays 
using microarrays are challenging, but have the 
advantage of providing multiple prognostic and 
predictive signatures in one assay. We have 
developed genomic predictive signatures for various 
clinically relevant cancer characteristics, including 
sensitivity to cytotoxic chemotherapies. We have 
investigated the analytic performance of these 
‘signatures in biologic replicates, and begun 
applying them in this prospective randomized 
clinical trial. Methods: Eligible patients have intact 
HERZ negative primary breast cancers with tumor 
‘size over 1.5 cm and any clinical lymph node status. 
A fresh frozen sample of the primary tumor must be 
‘obtained. After pathology assessment, RNA is 
analyzed on an Affymetrix H133 Plus 2.0 microarray. 
‘The expression data is used to predict doxorubicin 
(A) and docetaxel (T) sensitivity using previously 
defined signatures and analytic techniques 
developed to make one-at-a-time predictions from 
microarray data. Patients are then assigned to four 
cycles of standard doxorubicinicyclophosphamide 
(AC) or docetaxelicyclophosphamide (TC) 
‘chemotherapy in a blinded process randomizing in a 
1:2 ratio to a control arm (random assignment to 
either AC or TC) or a genomically guided (GG) 
‘chemotherapy arm. In the GG arm, patients are 
assigned to AC or TC treatment if the predicted 
‘sensitivity is over 0.6 for A or T, respectively. If 
predictions for both agents are over 0.6, the higher 
‘score is used; if both scores are below 0.6, patients 
are randomly assigned to AC or TC. The primary 
‘endpoint is pathologic complete response (pCR) in 
the breast, with the primary objectives: 1) determine 
whether genomic profiling for drug-sensitivity can 
improve the pCR rate as compared to randomly 
assigned therapy; and 2) determine whether 
genomic profiling can identify drug-sensitive and 
drug-resistant patients including a comparison of 
‘subgroups for the two individual regimens. The 
accrual goal is 270 patients randomized; currently, 
‘42 patients have been consented and 32 
randomized. (Current accrual updated in poster) 


‘Genomic signatures can provide an array of predictive biomarkers to 
guide selection of therapy. Using in vitro approaches can potentially 
accelerate this discovery process and expand the array of predictive 
tools. We have described in vitro cell culture experiments used to 
develop predictive genomic signatures, including commonly used 
cytotoxic chemotherapeutics (Refs 1-3). Figure One outlines the 
approach for docetaxel (T) and adriamycin (A). 


Figure 1: in vitro modeling 


Expression data from the NCL60 call ines 
having the greatest resistance and 
Sensitivity to docetaxel were used to 
Generate predictive signatures, which were 
thon validated in a set of Independent cell H 
lines. The same approach was used for q 
adriamycin. 


Figure 2: Retrospective Clinical Validation 


‘These signatures were then applied to a number of publically 
available clinical datasets with available genomic expression data to 
assess their performance for predicting sensitivity and resistance. 


ma 


In a preoperative breast cancer trial by Chang et al, (Ref. 4) pationts treatod 
with single agent docetaxel were assessed for clinical response, and genomic 
profiling data obtained on the cancers. The In vitro signature demonstrated 
{good accuracy for predicting sensitivelresistant tumors (A). In a second 
preoperative tral, Bonnefoi ot al genomically profiled cancers treated with 
standard Fluorouracil/Epirubicin/Gyelophosphamide (FEC) or 
Eplrublein/Taxotere followed by Taxotere (TET). A combination of in vitro 
dorived signatures predicted clinical response (B),(Ref.8) 


Figure 3: Microarray Signature Precision 


To develop the use of microarray data as a clinical assay that 
allows simultaneous assessment of multiple predictive signatures, 
including those for A and T, we profiled replicate samples 

from the Duke Breast SPORE, performed 

hierarchical clustering (A) and principal 

component analysis (B) to assess con- = 

cordance of genomic profiling. The 

signatures for A and T were also assess- 
ed (C) and showed intraclass correlation , 
coefficients (ICC) of 0.72 and 0.64, 
respectively. These data provided 
evidence of reproducible precision for 
these and other genomic signatures. 

(Ref. 7) Full gene lists for the signatures . 
are provided in this reference. 


Figure 4: Prospective Use of Gene Signatures 


Akey aspect of employing 
in vitro derived gene signatures 
prospectively requires normal- 
lization of the cell line training 
set data into a standardization 
set of tumor expression data. 
This allows one-at-a-time 
prediction using the genomic 
signature of interest. The 
statistical methods developed 
for this approach are described 
further in abstract #10513 from this meeting. 
(Ref. 8). Similar approaches have been shown 
to work by others (Ref. 9). 


Based on this work, we are conducting a prospective random-ized 
preoperative clinical trial assessing the use of the A and T signatures 
to choose between two standard of care regimens. In this design, the 
biomarker is integral to the trial design. The eligibility criteria, process 
for obtaining tissue, and clinical schema are shown below. The 
method of treatment assignment to AC or TC is double blinded. 


Figure 5: Patient eligibility and tissue acquisition 
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Primary Objectives and Statistical Design: 


41. To determine in early stage breast cancor treated with PST whether 
‘genomic profiling for drug-sensitivity can improve the pCR rate as 
compared to random assignment of patients to therapy. 


2. To determine in early stage breast cancer treated with PST whether 
{genomic profiling can Identify drug-sensitive and drug-resistant 
patients including a comparison of subgroups for the two individual 
ragimens (le. AC and TC). 


Primary objective 1 is to test fora difference between arms in PCR rate using ao 
Sided Type 1 error rate of 0.05. The null hypothesis is thatthe POR rate in both arms 
[ 0.13. Given that the tue pCR rates are 0:13 versus 0.26, the chi-square test of 
proportions has 82% power of rejecting the null hypothesis. Given that the true PCR 
‘ates are 0.13 versus 0.24, the chi-square test has 72% power. Even if the null 
hypothesis is not rejected, Primary Objective 2 will have reasonable power to 
{examine the difference in pCR rates between predicted drug sensitive and predicted 
Grug resistant forthe indvidual regimens. An interim analysis fs scheduled at 90, 
patients to assess for fut. 


+ 50 patients have been consented at DUMC 


+ 38 patients have been randomized 
+ 4 additional clinical sites have recently been 


approved for enrollment. 
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Abstract 


Background: The development of genomic signatures as predictive 
biomarkers of chemotherapeutics provides the potential to guide 
therapeutic choice in instances where standard-of-care includes multiple 
options. A critical step is their application in clinical studies designed to 
evaluate the capacity to improve selection of drugs for the individual 
patient. However, the prospective use of microarray technologies and 
multi-gene models to a single patient sample in the context of a clinical 
trial requires several methodological advancements. 


Methods: We describe a strategy that makes use of a trial-specific 
reference tumor dataset to provide a source of information for 
normalization of microarray data between the pre-clinical training set (cell 
lines) and the single patient sample. Validations of chemosensitivity 
signatures are repeated under this strategy using historical datasets with 
split-sample and bootstrap approaches. 


Results: Genomic signatures for adriamycin, docetaxel and cisplatin 
are mapped to Affymetrix GeneChip© HG-U133Plus2.0 platform. 
Prospective predictions of adriamycin sensitivity were associated with 
response to neoadjuvant treatment of 133 breast cancers (AUC=0.76, p 
< 0.001). Prospective predictions of docetaxel (n=14, AUC = 0.92, p = 
0.007) and cisplatin sensitivity (n=49, AUC = 0.97, p = 0.0001) 
discriminated between responders and non-responders under first-line 
treatment of advanced ovarian cancer. Bootstrap analyses show the 
genomic signatures require that reference sets have similar 
characteristics to the investigations sample and be of sufficient size to 
capture variation in gene expression in the patient population. 


Conclusions: These results demonstrate the accuracy of the genomic 
signatures are comparable to retrospective assessments when properly 
applied to a single sample in the prospective setting. We further describe 
trial designs to test the efficacy of these signatures and an automated 
data analysis infrastructure to enable their use in a clinical context. 


Figure 2: Strategies to Predict 
Response to Chemotherapy with 
Cancer Cell-line Data (e.g. Docetaxel) 


Figure 3: Prospective Approaches to Gene Signatures for Clinical Applications 
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Figure 4: Validation of the Prospective Approaches for Gene Signatures 


Figure 1: Gene Signatures 
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Predictors are derived from sensitive and resistant cell lines defined in 
from Gyorrfy et al and Potti A et al. Genes were selected to be most highly 
correlated (Pearson) to the cell-line phenotypes. Expression values of each 
signature are displayed as heatmaps images. 


Evaluating the Gene Signatures in 
Prospective Clinical Studies 


Docetaxel - Ovarian Cancer Dataset (N=14) Cisplatin - Ovarian Cancer Dataset (N=43, split) 
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The performance of internal reference sets was evaluated using (A) N = 110 
samples from patients that did not receive the investigational agent — 
Docetaxel; (B) 5-fold cross validation into reference (N = 106/107) and a test 
(N = 27/26) sets — Adriamycin; and (C) repeated splitting of samples into 
mo. aes < aye Se SING reference (N=20) and test (N=23) cohorts — Cisplatin. For Cisplatin, the left 
panel illustrates one iteration; right panel gives the range of results across 
100 resamples per subject (average Sens = 0.87, Spec = 0.91). . 
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A critical aspect of evaluating the performance of the gene signatures is 
to define the conditions and procedures necessary for implementation in 
day-to-day clinical practice. The need for data normalization, as detailed 
in Figure 2, presents a significant challenge, considering the fact that in 
the context of a trial, one patient sample must be analyzed at a time. 


We propose two different strategies illustrated in Figure 3 that involve 
three sources of information- 1) the training set data that derives from the 
cancer cell lines identified as sensitive and resistant to the drug, 2) a 
reference dataset that provides the tumor specific gene expression 
context, and 3) the test sample. 


As one approach (left panel of Figure 3; Type 1 setting), the training and 
reference datasets are merged before performing a SVD. The principal 
components from the merged data set are applied to the Bayesian model 
without including phenotype information for the reference samples, and 
the Bayesian model is then applied to post-processed expression data 
from the test sample. 


In an alternative approach (right panel of Figure 3; Type 2 setting), the 
reference set is used to directly standardize the expression data from 
each incoming tumor sample to the training set, while determining the 
principal components and developing the model solely within the training 
set data. 


Figure 5: Impact of a Reference Set on Predicting Response (e.g. Adriamycin) 


Figure 6: Detailed Work-Flow for 
Prospective Use of Gene Signatures 
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Predictive models of chemosensitivity 
were derived using an_ established 
method of data decomposition and binary 
regression models, as described in West 
2001. As the development phase, 
training data alone are used to select the 
features of the model as the probe sets 
having the strongest (Pearson) 
correlation to the tabulated phenotype. 
Expression values were summarized by 
the top two principal components from a 
singular value decomposition under the 
sets of samples defined by a Type 1 and 
Type 2 analysis (see Figures 2 and 3). 


probability is taken as the average value from the posterior distribution of the model. 
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Summarized expression values (termed 
‘metagenes’ in previous publications) are applied to a Bayesian probit regression model 
with non-informative priors for the parameters pertaining to the linear model and variance 
terms. A Markov Chain Monte Carlo (MCMC) is used to obtain the posterior distribution for 
the linear predictor and regularized probabilities from the training dataset, using 1000 burn- 
in’s and 5000 iterations to ensure convergence. For the validation phase investigational 
samples are normalized by the SVD or scale/shift standardizatrion, the expected predicted 


GSE2034 dataset has similar global expression levels to the neoadjuvant breast cancer 
dataset. A set of 50 breast cancer samples obtained from Duke show a different global 
profile likely due to a difference in the Affymetrix platform (U133 plus 2.0 versus U133A). 
Therefore the results from applying the adriamycin signature are more discordant with the 
retrospective and 5-fold cross-validated results (Figure 4). 
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Figure 7: A Quantitative Approache 
to Evaluating a Reference Set 
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Figure 8: Schema for a Randomized 
Clinical Trial Design to Evaluate the 
Performance of Multiple Genomic 
Signatures in Directing Treatment 
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Statistical analysis plans for this trial design can provide preliminary 
evidence of the performance of genomic-guided therapy (aka 
randomized phase II trial), or definitive evidence of superiority to 
random selection (aka phase III trial). 
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Purpose 

Identifying sources of variation in expression microarray data and the effect of variance in gene 
expression measurements on complex predictive and diagnostic models is essential when 
translating microarray-based experimental approaches into clinical assays. The technical reproduc- 
ibility of microarray platforms is well established. Here, we investigate the additional impact of 
intratumor heterogeneity, a largely unstudied component of variance, on the performance of 
several microarray-based assays in breast cancer. 


Patients and Methods 

Genome-wide expression profiling was performed on 50 core needle biopsies from 18 breast cancer 
patients using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Global profiles of 
expression were characterized using unsupervised clustering methods and variance components 
models. Array-based measures of estrogen receptor (ER) and progesterone receptor (PR) status were 
compared with immunohistochemistry. The precision of genomic predictors of ER pathway status, 
recurrence risk, and sensitivity to chemotherapeutics was evaluated by interclass correlation. 


Results 
Global patterns of gene expression demonstrated that intratumor variation was substantially less than 
the total variation observed across the patient population. Nevertheless, a fraction of genes exhibited 
significant intratumor heterogeneity in expression. A high degree of reproducibility was observed in 
single-gene predictors of ER (intraclass correlation coefficient [ICC] = 0.94) and PR expression (ICC = 
0.90), and in a multigene predictor of ER pathway activation (ICC = 0.98) with high concordance with 
immunohistochemistry. Substantial agreement was also observed for multigene signatures of cancer 
recurrence (ICC = 0.71) and chemotherapeutic sensitivity (ICC = 0.72 and 0.64). 


Conclusion 

Intratumor heterogeneity, although present at the level of individual gene expression, does not 
preclude precise microarray-based predictions of tumor behavior or clinical outcome in breast 
cancer patients. 


J Clin Oncol 28:2198-2206. © 2010 by American Society of Clinical Oncology 


based assays to guide clinical decisions, translating 


INTRODUCTION 


Since its inception, microarray technology has pro- 
vided a powerful tool to the research community 
because of its ability to simultaneously measure the 
expression of tens of thousands of genes. In particu- 
lar, breast cancer research has seen great benefits 
from this technology, with many studies describing 
multigene expression patterns associated with di- 
agnostic and prognostic subclasses among other- 
wise indistinguishable tumors.'* These studies have 
also established the ability to predict a cancer pa- 
tient’s treatment response based on gene expression 
patterns.*”” With the clear potential of microarray- 
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these assays to the clinical laboratory is imperative. 
Clinical translation requires an understanding 
of factors that influence the precision and accuracy 
of microarray-based assays. Chief among these fac- 
tors is the variability of gene expression measure- 
ments, which can be divided into technical (intrinsic 
to the platform) and preanalytic (intrinsic to the sam- 
ple) sources of variation. The Affymetrix GeneChip 
Human Genome U133 Plus 2.0 array platform (Af- 
fymetrix, Santa Clara, CA), investigated herein, 
has a high degree of reproducibility and thus little 
technical variance, as established by several groups 
including the MicroArray Quality Control (MAQC) 
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project.*” Variance intrinsic to a sample is more difficult to control, 
particularly for solid tumor specimens where intratumor hetero- 
geneity could result in significant sampling bias. Small sampling, 
such as needle core biopsies, can yield samples from the same tumor 
with different histologic and biologic features. The effect of tumor 
heterogeneity on microarray-based assays has been evaluated in 
some cancers, although breast cancer is surprisingly understudied 
in this regard.'*"'7 

Here, we investigate the impact of tumor heterogeneity on sev- 
eral microarray-based predictors of biologic behavior and clinical 
outcome in breast cancer patients. Multiple core biopsies from indi- 
vidual patients were evaluated by routine histology and tested using 
single-gene measurements and multigene signatures that would be 
integral to the routine care of the breast cancer patient, including 
estrogen receptor (ER) status, progesterone receptor (PR) status, risk 
of cancer recurrence, and chemotherapeutic sensitivity. Precision for 
each of these predictors was measured and evaluated in the context of 
performance expectations for clinical assays. 


PATIENTS AND METHOD 


Tumor Sample Collection and Histologic Analysis 

Following patient consent, samples were obtained from breast cancer 
excisions as part of a Duke University Health System (DUHS) institutional 
review board-approved tissue banking and research protocol for the Duke 
University Medical Center (DUMC) Breast Cancer Specialized Program of 
Research Excellence (SPORE). Immediately after surgical excision, lumpec- 
tomy specimens were sampled by 14-gauge needle core biopsy using an imag- 
ing device, as previously described.'* The core biopsies were embedded in 
Tissue-Tek OCT (Qiagen, Valencia, CA) and frozen in liquid nitrogen. One 
5-j.m frozen section was prepared from each sample, stained with hematoxy- 
lin and eosin, and evaluated by an expert breast pathologist (J.G.). Routine 
pathologic evaluation of the corresponding clinical specimens by the DUHS 
laboratories included determination of ER and PR status by immunohisto- 
chemistry (IHC). Samples with an Allred score of 0 or 2 were classified as 
negative. Human epidermal growth factor receptor 2 (HER2) status was de- 
termined by IHC and fluorescent in situ hybridization. This study included 
only HER2-negative patients with at least two frozen core biopsies containing 
neoplastic cells. 


RNA Purification and Microarray Hybridization 

Total RNA was extracted using a kit-based method (RNeasy, Qiagen, 
Valencia, CA). RNA quality was assessed using an Agilent bioanalyzer (Agilent 
Technologies, Santa Clara, CA). Hybridization targets were prepared from 2 
mg of total RNA and hybridized according to standard Affymetrix protocols 
using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Arrays 
were scanned on the Affymetrix GeneChip scanner and probe set expression 
values, percent present, and 3'/5’ probe set ratios for actin and glyceraldehyde- 
3-phosphate dehydrogenase were calculated using the Affymetrix Microarray 
Analysis Suite v5.0. 


Microarray Preprocessing and Statistical Analysis 

Expression estimates for the 50 DUMC tumor biopsies and for the 
publicly available breast cancer microarray data sets (GEO# GSE3494"° and 
GSE1456°°) were obtained by robust multiarray average”’ and log, trans- 
formed. The ratio of intratumor variance to total variance among the DUMC. 
biopsy specimens was calculated for all probe sets. The total variance was 
calculated as the sum of squared differences from the mean expression for all 
samples. Intratumor variance was calculated as the sum of squared differences 
from the mean expression within individual tumors.’* Global patterns of 
expression were evaluated by principal component analysis (PCA) and hier- 
archical clustering using average linkage of the Pearson correlation coefficient. 
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Predictors of ER pathway activation and breast cancer recurrence 
were generated from published breast cancer microarray data sets (GEO# 
GSE3494'° and GSE1456,”° respectively), using established methodologies.*”** 
Briefly, tests of differential expression were used to select gene sets strongly 
correlated to phenotype. Expression values were summarized by the top prin- 
cipal components and fitted to a Bayesian probit regression model. Predicted 
probabilities were generated for these predictors, and for previously identified 
signatures of sensitivity to chemotherapeutic agents doxorubicin and do- 
cetaxel.”* Binary classifications were made using thresholds defined a priori for 
each signature. 

The precision of all signatures was evaluated using fixed-effects analysis 
of variance (ANOVA) models and the intraclass correlation coefficient (ICC 
{1,1]).?* ICC values ranging from 0 to 1 have been characterized by Landis and 
Koch”? as indicating moderate (0.41-0.60), substantial (0.61-0.80), and almost 
perfect agreement (0.81-1.00). Accuracy of the single-gene ER and PR predic- 
tors and the multigene ER pathway predictor are reported with 95% Cls. The 
influence of clinical and technical covariates on precision and accuracy are 
assessed using analysis of covariance (ANCOVA) models. All microarray pre- 
processing and analysis were performed in R/Bioconductor and Matlab (The 
Mathworks, Natick, MA) with graphics generated using Graphpad Prism 
(GraphPad Software, La Jolla, CA) and Cluster/Treeview software (Eisen 
Lab, Berkeley, CA).?° 


RESUL 


Characterization of Morphologic Heterogeneity in 
Discrete Samplings of Individual Breast Tumors 

Fifty samples were obtained from 18 patients as part of a tissue 
banking and research protocol for the DUMC Breast Cancer SPORE. 
One patient had four replicate samples, 12 patients had triplicate 
samples, and five patients had duplicate samples (Table 1). Analysis 
was performed on frozen sections stained with hematoxylin and eosin 
for each of these core biopsies. The sample set contained a mixture of 
ER- and PR-positive and ER- and PR-negative cases (11 ER-positive/ 
PR-positive, two ER-positive/PR-negative, one ER-negative/PR- 
positive, four ER-negative/PR-negative) using the diagnostic core 
biopsy IHC as the standard. Invasive carcinoma was present in 49 
samples, while one biopsy contained ductal carcinoma in situ only. 
One set of biopsies (patient B) contained lobular carcinoma; the 
remaining cases were ductal-type carcinomas. Invasive tumor cellu- 
larity varied from 10% to 90%. RNA and microarray quality control 
metrics are provided in Table 1. 


Intertumor Variance Exceeds Intratumor Variance at 
the Level of Gene Expression 

To evaluate the performance of individual features on the Af- 
fymetrix array, we calculated the ratio between intratumor variance 
and total variance for all probe sets. As shown in Figure 1A to 1C, the 
majority of samples had an intratumor/total variance ratio below 0.5. 
A distinct inverse relationship exists between expression intensity and 
the proportion of intratumor variance. Specifically, within the top 
quartile of probe sets of highly expressed genes (Fig 1A) more than 
90% had an intratumor variance ratio less than one third of the total 
variance. Conversely, only 11% of genes expressed at comparatively 
low levels (bottom quartile, Fig 1A) had an intratumor/total variance 
ratio less than one third. We also found that probe sets corresponding 
to the U133B platform generally had higher intratumor variability 
when compared with those on the original U133A array (P < .001; Fig 
1B), and a similar pattern was noted for probe sets annotated to 
known genes compared with unannotated probe sets (P < .001). 
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Table 1, Histology, RNA, and Microarray Quality Control Measures 
Histology QC RNA QC Microarray QC 

Patient CA Concentration 

Sample Histology Size Grade ER PR HER2 (%) Other Features (ug/uL) 260/280 260/230 %P Actin GAPDH 
Al Ductal 23 3 + + +— 90 0.339 208 203 642 149 1.44 
A2 80 0.097 209 237 497 209 2.09 
AB 65 0.162 2.09 2.08 639 2.04 1.77 
B1 —_Lobular 3.6 2 + + - 60 0.068 2.02 224 534 134 1.26 
B2 85 0.08 209 244 506 121 1.12 
B3 70 0.068 2.11 186 53 181 1.36 
C1 Ductal 2 2 + + = 10 Biopsy site 0.042 1.98 239 543 149° «1.14 
c2 80 Biopsy site 0.093 206 163 56 137 1.01 
3 40 Biopsy site 0.07 204 084 57 124 1 
D1 Ductal 2.8 2 + + - 30 0.158 208 227 553 1.54 1.31 
D2 50 0.147 208 169 504 135 1.28 
D3 70 0.182 24 222 62 121 1.23 
E1 Ductal 24 3 - - = 2 0.269 208 182 527 117 1.03 
E2 75 0.324 2.08 2.27 509 1.14 1.05 
E3 50 0.26 208 22 535 1.08 1.09 
F1 Ductal/micropapillary Mult-focal 2 + -— - 26 0.06 2.11 18 641 189 1.17 
F2 60 0.292 206 219 54 119 1.13 
F3 25 0.079 204 213 542 118 099 
G1 Ductal 34 3 = ene e-7 0 0.515 208 219 513 331 1.31 
G2 30 0.076 2.03 239 561 113 0.98 
G3 70 0.331 207 2.24 544 1.21 1.04 
H1 Ductal 1.6 3 - - - 30 0.112 2.11 1.69 503 2.94 1.12 
H2 30 0.096 1.92 092 54.1 2.21 1.09 
H3 30 0.072 212 1.29 497 149 1.09 
Ha 20 0.285 2.09 1.98 496 666 1.95 
I) Ductallobular >2 2 + + = 70 0.237 2.11 1.96 525 235 1.19 
12 70 0.331 2.09 203 51 217 1.15 
13 70 0.173 212 207 515 179 14 
Jt Ductal 3.6 3 + + - 80 0.682 208 193 498 385 1.47 
J2 80 0.941 208 211 513 291 1.42 
J3 80 0.561 207 217 508 349 1.57 
K1 Ductal 24 2 + + = 15 DCIS 0.068 212 1.66 514 188 1.22 
K2 75 DCIS 0.044 2.06 185 55 1.85 1.04 
L1 Ductal 13 3 + + - oO DCS 0.055 24 131 51.2 185 1.17 
L2 15 DCIS 0.083 2.11 1.84 50.7 139 0.92 
L3 15 DCIS 0.099 2.09 1.21 51 169 1.06 
M1 Ductal 19 2 + + = 50 0.116 24 174 50.3 249 1.75 
M2 50 Necrosis 0.09 208 1.74 491 158 14 
M3 30 0.084 212 165 45.5 2.06 1.35 
Ni Ductal 18 2 + + - 20 0.077 214 18 505 167 1.19 
N2 70 0.068 206 1.96 529 128 1.09 
NB 70 0.069 213 067 516 1.74 1.33 
01 Ductal 3} 1 + + = 20° Carcinoma + papilloma 0.261 211 22 53.7 2 0.95 
02 50 Carcinoma + papilloma 0.143 213 1.24 «655 206 1.13 
P1 Ductal 0.9 3 - - - 6 0.05 211 189 544 134 1.06 
P2 10 0.044 215 161 566 134 1.05 
Q1 Ductal 18 370m Biopsy site 0.474 207-203 619 213 1.08 
a2 30 Biopsy site 0.226 211 2.03 56.1 2.02 1.02 
R1 Ductal 28 3. + -— - 70 Biopsy site 0.156 207-21 «628 166 1.12 
R2 20 Biopsy site 0.132 24 214 528 233 1.26 

NOTE. 260/280 and 260/230 are optical density ratios; CA is the tumor content in percent; % P is the percent of probe sets present; actin and 

glyceraldehyde-3-phosphate dehydrogenase (GAPDH) are the 3’ to 5’ probe set ratios for the actin and GAPDH probe sets, respectively. 

Abbreviations: QC, quality control; ER, estrogen receptor; PR, progesterone receptor; HER2, human epidermal growth factor receptor 2; DCIS, ductal 

carcinoma-in-situ. 


To evaluate the global patterns of expression, we performed PCA 
and hierarchical clustering on robust multiarray average—normalized 
data. Hierarchical clustering of all samples using probe sets with log, 
expression values > 5 demonstrated that replicate samples from a 
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single tumor tended to group together in a robust and statistically 
significant fashion (Fig 2A and Data Supplement, online only). One 
tumor showed imperfect clustering (patient F). There were no histol- 
ogy, RNA, or array quality metrics that could account for discordance 
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Fig 1. For Affymetrix probe sets, intratumor variance is generally a small 
component of total variance. Histograms stratified by (A) expression level 
demonstrate that profiles differ sharply between highly and lowly expressed 
genes, and (B) source demonstrate that U133A probe sets are generally more 
reproducible. (C) Scatter plot demonstrates an inverse relationship between 
variance ratios and average expression. 


between samples F1, F2, and F3. A PCA of the filtered expression data 
(Fig 2B) demonstrates that patient replicates largely cluster together in 
the top two principal components (capturing 28% of the total vari- 
ance in global expression), indicating that more heterogeneity is seen 
across patients than within replicates. In summary, while most genes 
exhibit a low degree of intratumor variability and most replicate sam- 
ples demonstrate similar global expression patterns, poor performing 
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probe sets exist and could potentially have an impact on the precision 
of array-based predictors of tumor biology or behavior. 


Intratumor Variance in Gene Expression Does Not 
Preclude Precise Predictions of Tumor Biology 

ER and PR status are two critical characteristics of breast cancer 
that define biologically distinct subgroups of disease. ER status and PR 
status are prognostic of clinical outcome and often determine the 
course of treatment. To determine the effect of intratumor variance on 
array-based assessments of hormone receptor status, we began by 
evaluating single-gene predictors of ER (probe set 205225_at)*” and 
PR expression (probe set 208305_at). Expression levels of 205225_at 
demonstrated almost perfect agreement among the replicates samples 
(ICC, 0.94; 95% CI, 0.86 to 0.97; Fig 3A). Further, using an optimal 
binary classification (threshold of log, = 9.6 to maximize accuracy), 
only one patient showed discordance with IHC (sensitivity, 1.0; 95% 
CI, 0.90 to 1.0; specificity, 0.88; 95% CI, 0.62 to 0.98). A single probe 
set for the PR gene, 208305_at, also showed near perfect agreement 
among replicates (ICC, 0.95; 95% CI, 0.89 to 0.98), but with an 
optimal binary classifier (threshold of log, = 5.2 chosen to maximize 
accuracy), greater disagreement was noted between PR expression and 
THC results. While all samples from the six PR-negative patients were 
classified correctly (specificity, 1.0; 95% CI, 0.80 to 1.0), samples from 
five of the 12 PR-positive patients were discordant with IHC (sensitiv- 
ity, 0.64; 95% CI, 0.45 to 0.80). 

We next created a multigene predictor of ER pathway activation 
froma previously published breast cancer Affymetrix U133A microar- 
ray data set (GEO# GSE3494). This data set was filtered to retain probe 
sets with mean log, expression values > 5.0 for the 247 ER-annotated 
patient samples.'? The ER predictor was based on 1,022 probe sets 
identified by a Wilcoxon rank sum test with Bonferroni correction 
(adjusted P < .05), with intentional exclusion of probe sets for ER 
itself (Data Supplement). The large number of differentially expressed 
genes highlights the distinct biologic characteristics of these different 
tumor types. A predictor of ER pathway status was generated by 
applying the top principal component of expression from the 1,022 
probe set list to a Bayesian probit regression model. Under leave-one- 
out cross validation and a threshold of 0.5, the model classified 91% of 
the ER-negative samples and 85% of the ER-positive samples by IHC 
correctly (Fig 3B). Applied prospectively to the breast cancer replicate 
data set with an optimal threshold of 0.45 (Fig 3C), the model showed 
near perfect precision (ICC, 0.98; 95% CI, 0.95 to 0.99) and 96% 
accuracy identical to the single-gene model. Taken together, these data 
show that when assaying a robust biologic property of breast cancer 
such as ER status, intratumor variance does not preclude precise 
predictions from microarray data. 


Intratumor Heterogeneity Does Not Preclude Precise 
Predictions of Clinical Outcome 

A prognostic model for death attributed to breast cancer was 
generated from expression data from a previously published data set 
(GEO# GSE1456; N = 159).”° Probe sets associated with survival were 
identified using a Cox proportional hazard model and the Benjamini- 
Hochberg method for controlling false discovery rates.”* A total of 205 
probe sets (representing 184 genes) were identified with a false discov- 
ery rate < 0.01. These probe sets contain an over-representation of 
genes involved in cell cycles (20 genes; P < .001), cytokinesis (seven 
genes; P = .004), and cellular metabolism (84 genes; P = .004; Data 
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Supplement). A binary classifier of survival was created using the first 
two principal components. The fitted model accurately stratified pa- 
tients into high and low risk for death from disease with a threshold of 
0.50 (Fig 4A). When this genomic signature was next applied to an 
independent validation data set (GSE3494; N = 315),'° it maintained 
the ability to identify a high-risk cohort for survival (P = .0069; Fig 
4B). The precision of the prognostic genomic signature was found to 
be substantial (ICC, 0.71; 95% CI, 0.48 to 0.87), when applied to the 
replicate samples (Fig 4C). The patient with the most variance in 
recurrence risk predictions (patient F) also showed the poorest inter- 
nal concordance by hierarchical clustering and PCA analysis, suggest- 
ing that the global variance in expression within this tumor may have 
affected the precision of the risk predictor. 

Finally, we established the precision of previously published 
chemotherapeutic sensitivity predictors for doxorubicin and do- 
cetaxel (Fig 5A and Data Supplement), as implemented in a random- 
ized phase II trial to direct neoadjuvant therapy.” A substantial level of 
agreement is observed for the doxorubicin sensitivity predictor (ICC, 
0.72; 95% CI, 0.49 to 0.88). Further, 12 of 18 patients showed complete 
concordance when applying an a priori threshold of 0.6 used in the 
clinical trial. The docetaxel sensitivity predictions showed a slightly 
lower level of precision (ICC, 0.64; 95% CI, 0.38 to 0.83), with 14 of 18 
patients showing complete concordance. Under a multilevel classifi- 
cation of higher sensitivity to one agent or of double resistance (Fig 
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5B), 13 of 18 patients showed complete concordance. By resampling 
from a binomial mixture distribution, the observed agreement in the 
three-level classification was highly significant (P < .001). Variation in 
tumor content and quality control measures for RNA and Affymetrix 
arrays was found not to be associated with the discordance in the 
multigene predictors of clinical outcome (all ANCOVA P > .05). 


Here, we demonstrate that intratumor variance at the level of gene 
expression does not preclude the development of precise microarray- 
based clinical prediction models in breast cancer. We show intertumor 
heterogeneity is greater than intratumor heterogeneity at the level of 
global gene expression for this breast tumor data set. While a small 
group of genes exhibits a significant level of intratumor variation, 
many of these genes are expressed at relatively low levels and can be 
filtered as background noise when creating predictive algorithms. 
Finally, we demonstrate that a high degree of precision was seen 
among replicate samples when assayed using single-gene predictors of 
ER and PR status and PCA-based predictors of ER pathway activation, 
cancer recurrence, and chemotherapeutic sensitivity. 

The heterogeneous nature of solid neoplasms has been recog- 
and studied by pathologists for decades. In the context of breast 
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trogen receptor (ER) and progesterone 
receptor (PR) status demonstrate high 
precision among individual patients and 
agreement with immunohistochemistry. 
(B) A multigene predictor of ER pathway 
activation generated from an independent 
data set of 247 patients (C) demonstrates 
high precision and agreement with immu- 
nohistochemistry in the breast replicate 
data set. Blue solid circles, discordant or 
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Fig 4. Novel multigene signatures for breast cancer survival demonstrate prognostic value by Kaplan-Meier plot and log-rank tests under (A) cross-validation in the 
training data (GSE1456; N = 159), (B) independent validation set in a second data set (GSE3494; N = 315), and (C) substantial agreement when applied to the replicate 
data set. Blue solid circles, discordant or inaccurate samples; gold solid circles, complete concordance. Capital letters are patient designations. HR, hazard ratio. 


cancer, the varying presence of normal breast tissue, inflammatory 
cells, vessels, necrosis, and neoplastic epithelium gives rise to a variably 
mixed population of cells with unique or distinct biologic makeup in 
any given tumor sampling. Thus, the question has been raised of how 
this cellular heterogeneity may affect assays typically performed for 
diagnostic and prognostic purposes in breast cancer patients, includ- 
ing ER, PR, and HER2. 

The largest study of the reproducibility of HER2 testing was 
performed on patients enrolled in the North Central Cancer Treat- 
ment Group (NCCTG) N9831 adjuvant trial of trastuzumab. In this 
study, only 85.8% of the 2,535 patients registered in the trial had 
concordant results for HER2 positivity between the local and central 
performing laboratories (88.1% concordance by fluorescent in situ 
hybridization and 81.6% concordance by IHC).”’ A similar study of 
ER and PR IHC measurements in 776 patients enrolled on Eastern 
Cooperative Oncology Group (ECOG) 2197 showed 90% and 84% 
concordance between local and central laboratory studies for ER and 
PR status, respectively.” ° These studies, however, are strictly a measure 
of assay reproducibility; a measure of technical variance rather than 
tumor heterogeneity. 

Discordance attributable solely to tumor heterogeneity between 
breast core biopsy and resection specimens ranges from 1.2% for ER 
status on the low end”! to 14% for ER status and 20% for PR status on 
the high end.*” Discordance rates attributable to technique and tumor 
heterogeneity that develops over time was determined to be even 
higher (18.4%, 40.3%, and 13.6% for ER, PR, and HER2, respectively) 
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when comparing primary tumors and their corresponding metastatic 
lesions.*? The low discordance rates (0% in this study) of our array- 
based predictors of ER and PR status are at least equivalent to the good 
performance of these traditional techniques. 

While the effect of tumor heterogeneity and technical variance 
on ER, PR, and HER2 testing has been well studied, few studies 
have examined the effect of tumor heterogeneity on multigene pre- 
dictive algorithms. A study comparable to the one presented here 
reached similar conclusions using a 48-gene TaqMan-based assay.** 
This study showed high concordance among three replicate samplings 
for 12 breast cancer patients. A similar study examining the contribu- 
tion of technical variance to the reproducibility of microarray-based 
assays in breast cancer, demonstrated that gene expression—based 
signatures developed from replicate experiments among 35 patients 
resulted in precise predictions of breast cancer chemotherapeu- 
tic responsiveness." 

To the best of our knowledge, our work is the first to demonstrate 
the impact of intratumor heterogeneity on the precision of multigene 
predictive models focused on tumor behavior. Precision is an integral 
component of clinical testing that is often overlooked in the early 
stages of translational research. Accuracy always seems of primary 
interest at that stage. However, our data suggest that a lack of accuracy 
in microarray-based assays may in fact be caused by a lack of precision, 
particularly for more indirect measures of tumor behavior (eg, che- 
mosensitivity or recurrence). For these more abstract measures, a lack 
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of precision may be attributable to true differences in tumor microen- ___ strated in this study. This full genome expression profile may find 


vironment. This is supported by our finding that the genes in these 
complex predictors tend to show higher intratumor variance (Data 
Supplement). In fact, there is a direct correlation between each predic- 
tor’s ICC and the proportion of genes with high intratumor variance. 
Our data also highlight the fact that testing of replicate samples, even 
early in the development of a microarray-based assay, can clearly 
differentiate between inaccurate and imprecise. 

The importance of assessing performance of multigene 
microarray-based assays, as described here, bears on the future use of 
this technology as a single assay for breast cancer patients that can 
provide not only measures of prognosis or predicted therapeutic re- 
sponse, but can also supplement or replace the standard assays of ER, 
PR, and HER2. The feasibility of this approach is, in part, demon- 
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additional uses as assays become algorithms applied to expression data 
sets. As we move toward this, understanding the role of tumor heter- 
ogeneity in measures of tumor behavior and developing approaches 
and data sets (like the one provided here) to test the precision of these 
algorithms is essential. 
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Background: “Performance of Genomic : 
Expression Profiles to Direct the Use of 
Preoperative Chemotherapy for Early Stage iti : . ; 
Breast Cancer” was a prospective randomized Bison ees Le = Standard of Care Regimens = By chemotherapy: 
trial designed to assess in vitro derived ¢ Preoperative chemotherapy is a proven approach that provides an ; . 
signatures for predicting sensitivity to early surrogate endpoint (pCR) for validating predictive markers. * Docetaxel 75mg/m2 and cyclophosphamide 600 mg/ m2 
doxorubicin and docetaxel. Methods: Patients IV every 3 weeks for 4 cycles (TC) 

with HER2 negative early-stage breast greater 
than 1.5 cm were eligible. Either banked or fresh 
cores were then profiled using the Affymetrix 
U133Plus2.0 microarray. Patients with adequate 
microarray data were then randomized in a 2:1 : 
ratio to a genomically-guided arm using the 
microarray data to assign treatment with either 
doxorubicin (AC) or docetaxel (TC) based 
chemotherapy or a control arm using random 
assignment to AC or TC (standard doses every 3 
weeks for 4 cycles). Pathologic complete 
response (pCR) was defined as no residual 
invasive cancer. The primary aims were to 
determine whether genomic profiling could : 
improve pCR rates compared to random 
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Table 2: Response by final pathology 
and/or clinical evaluation 


Overall pCR 


13% (95% CI: 4-27%) 


Treatment Regimens: 


Genomic profiling can provide improved assessment of breast 
cancer biology, potentially providing improved prognostic and 


Toxicities: 


: =" Consistent with those seen with standard dose AC and TC 
« See Figure 1 for protocol schema 


AC: 11 of 19 with grade 3 or 4 adverse events (all 
neutropenia except one treatment unrelated PE) 


¢ — In vitro/cell-line derived predictive markers were developed for 
doxorubicin and docetaxel and evaluated retrospectively. The OR 


translation of in vitro markers successfully could significantly * Doxorubicin 60 mg/m2 and cyclophosphamide 600 mg/m2 
accelerate biomarker development. IV every 3 weeks for 4 cycles (AC) 


« TC: 8 of 20 with grade 3 or 4 adverse events (all 
neutropenia except one docetaxel reaction) 


Control (n=13) 
8% (1/13) 


Genomically-guided (n=26) 


15% (4/26) 


= By arm: P=0.64 


« Genomically-guided: 14 of 26 with grade 3 or 4 adverse 
events (all hematologic except one docetaxel reaction 
and one treatment unrelated PE) 


A prospective marker-based preoperative chemotherapy trial in 
early-stage breast cancer with Affymetrix-based genomic profiling| « 
integral to patient treatment assignment using standard of care 

chemotherapy regimens was designed to validate these markers. 


This analysis includes all randomized and treated patients. 
Following completion of assigned treatment, patients 
were Clinically re- assessed for response per standard of 
care with team and patient decision regarding whether 
to proceed to surgery or additional chemotherapy. 


Table 3: Response by final pathology only 


= Control: 5 of 13 with grade 3 or 4 adverse events (all 
hematologic) 


* Subsequent review of the foundational work deriving the markers 
and the retrospective validations showed this work to be flawed, . 
and it has subsequently been retracted. This validation trial was 
therefore no longer justified and was closed to accrual. 


The analysis below excludes four TC treated patients; 3 did 
not have definitive surgery or biopsy after completing 
protocol defined treatment and one had a docetaxel reaction 


and was treated with AC. 
AC Treated Cases OM Nero Ore 


No treatment related deaths 
Primary Aim: 
To determine in early stage breast cancer treated with primary 


systemic therapy whether genomic profiling for drug-sensitivity 
can improve the pathologic complete response rate as compared 


Patients were unblinded for arm assignment and . 
doxorubicin/docetaxel prediction scores after completing 
therapy. 


An underpowered analysis of the primary endpoint in the enrolled| «= 
patients was undertaken as part of the Data Safety and Monitoring 


Nodal assessment was performed pre —chemotherapy per 
institutional standard, although not required per protocol 


assignment; and to determine whether profiling Board review and to address concerns that the flawed biomarkers (i.e. could also be done at definitive surgery) . f pati h pCR pCR 

could identify drug-sensitive and drug-resistant might have resulted in inappropriate treatment assignments. eee ede o nan On baum romel any: 2 Re Generic ion execu Gree 
patients by comparing subgroups for AC and TC. (pCR 26% in genomically-guided v. 13% in control, 82% power) Prediction Prediction 

Results: The paper describing the : Rovian es 1 9 
chemosensitivity signatures has been withdrawn Table 1: Enrollment Demographics Figure 2: Pathologic Complete Responses (pCR) ee ee el Sere rene art a 
for reproducibility and data integrity problems. (Consented and Randomized, July 2008 to July 2010) i ; Sensitive 6 2 8 Sane 

Consequently the scientific justification for the J in Breast by treatment and arm (All Patients) Total 1S) lal 16 


Total 15 4 19 
Fisher’s exact p>0.99 


trial no longer exists and the trial was 
permanently closed on 11/1/2010. 39 patients 
were randomized. The overall pCR rate was 13% 
(95% Cl, 4%-27%). The pCR rates in the 


CONSENTED= 56 —+ 47 Screen Failures* 


Fisher’s exact p>0.99 


A Determination of eligibility, 
| Tissue acquisition and 
| genomic profiling 


Eligibility Criteria: 


genomically guided v. randomized arms were 15% ¢ Chemotherapy deemed appropriate by treating oncology team. en ea Adriamycin gj Docetaxel Swatitcaion Factor 
(4/26) and 8% (1/13), respectively (p=0.64). The No restriction on using other tests to assess chemotherapy 40-49 * Withdrew _ Screen Failure P n=180 =26 ee + Tumor size: Over/Under 4 cm 
analysis by individual regimens showed: AC indication (i.e. Oncotype allowed). aes sue ey ee wi aires ee | _ won Sonoma Said This underpowered analysis does not suggest any difference in pathologic 
overall pCR rate 21% (95% Cl, 6%-46%), predicted ¢ Tumor size over 1.5 cm by clinical assessment 7079 Se Sac planar Failure t “cau” || wae” || acmnmiecs mm | 2:1 Randomization | mmm n=90 =13 complete response rates between the investigational marker guided arm 
sensitive 25% v. predicted resistant 18% (p=1.0); a — a nena : J 4 y RANDOMIZED= 39 y and the control arm, or by prediction of resistance/sensitivity for either 
TC overall 5% (95% Cl, 0-25%), predicted sensitive ¢ Any ER/PR status allowed; HER2 negative by standard clinical Black 1 Inadequate RNA ‘ ve Bh [Randomize eae regimen. Given the conservative standard of care design of the study, 
0% v. predicted resistant 9% (p=1.0). Conclusion: assessment. Agere i =n ae) 19 AC; 20 TC there is no evidence that patients were harmed by participation. The 
This underpowered analysis was conducted on * No T4 lesions s a Yay J \ genomic data generated is of high-quality and performed on a standard 
the available data to examine the possibility of an ( oe oe ) AC Js . Affymetrix array. It will be made publically available for interested 
adverse trend in patient outcomes. No evidence * No contralateral cancer. - | (27 Fé =6 = | investigators pending completion of analyses and review. 

i i : : : ian tumor size is 3.2 cm —_—— = 
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this trial can inform the design of future integral 
biomarker-driven trials, particularly in the 
preoperative setting. 


multicentric disease. 


(range 1.6 cm to 7.5cm) 


ER positive 
ER negative 
ER unknown 


Surgical Management or 
Additional Therapy 
(Assessment of Pathologic CR) 
Unbinding of Treatment Assignment 


One patient randomized and assigned to genomically guided AC 
ineligible due to multicentric disease and treated off protocol 


‘ if chemo sensitivity to both agents is over 60%, 
then assign to highest 


Four patients have developed metastatic disease 


Two have died from recurrent disease; both had high-grade “triple- 
negative” disease 
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RESULTS: | Figure 5: Profiling timelines 


Background: Genomic assays have been shown Molecular assays hold promise for personalization of : J : : : ase ; Saas 
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data can also be used for discovery of new phenotypic signatures. 
U133Plus2.0 arrays and principal component chemosensitivity signatures (Publication now retracted, Ref. 3) , a entae 7Ah0 Direwt we use ur rreuperauve >: Whew ea 
analysis (PCA) plots. Results: 57 patients were Figure 2: Trial Design r Early Stage Breast Cancer 3 Declined repeat biopsy 3 multicentric Statistical methodologies for applying in vitro derived genomic 
consented and screened; 48 were eligible and had Air : : giotmingion et eligibiliy. 1 Oncotype=11 3 inelig (PS>2; Ift's; other ca) : é eer 
Fe SUS ERGHIRG REG. heey ee Clinical Trial Desi gn: rma satan ty ‘Insurance concerns 1 Hybridization Failure signatures to “one at a time’ clinical samples were developed and 
was successful on 41 tumors (85%), providing nT white ‘Nokwesive canoe ee eee 
data of sufficient quality to make treatment * Marker-guided design to test feasibility and utility of genomic mer aes ee Pe Hispanic 1 Inadequate RNA — a4 — a 
assignments. Median tumor size was 3.2 cm information for guiding chemotherapy selection. ae || cme, | oath | a Nesters HOWEVER, the scientific work describing and validating the in vitro 
(range, 1.6-7.5). One sample hybridization failed pyre ot geneity fi genety es 2:1 Randomization = om gut mae Median tumor size is 3.2 cm derived signatures was flawed and the supporting publications have 
QC as detected by PCA plots, two samples had ; ; _ ¥y ¥y been retracted. Since this work provided the justification for 
insufficient RNA, and four samples had ¢ Preoperative treatment to provide short-term clinically relevant andomize andomize ae (range 1.6 cm to 7.5cm) conducting the trial, this study has now been closed. 


endpoint 


Lobular 


Other 


ER positive 
ER negative 
ER unknown 1. Potti A. et al: Genomic signatures to guide the use of chemotherapeutics. Nat Med. 12:1294 1300, 2006 


inadequate tissue. The median “study consent to 
treatment” and “tissue to treatment” times were 

16 days (range, 10-54) and 13 days (range, 10-29), 
respectively. Conclusion: Real-time full-genome 


expression analysis on frozen tumor using an = : aye i 
Affymetrix platform can be feasibly incorporated Surgical Management or | Figure 4: Profiling metrics 


into an integral biomarker trial design, even in a * Choose a conservative clinical question, with importance, but aaron bana "57 Patients "52 Patients 
eal . . . . = ge . Unbinding of Treatment Assignment 
preoperative setting. The data can potentially he reasonable clinical equipoise. For this trial, initial preoperative poesia . : eae —— ieee —— 
: treatment with an anthracycline vs. a taxane based regimen. ae g 
analyzed for a number of prognostic and y g 1. To determine in early stage breast cancer treated with Sector A screen 3 
predictive signatures primary systemic therapy whether genomic profiling for drug- Saieeee a tissue not profiling 


peaks é ns profiled 4. West M et al: Predicting the clinical status of human breast cancer by using gene expression profiles. 
sensitivity can improve the pathologic complete response rate as ial iii cen enti ee aie 
compared to random assignment of patients to therapy. 


Maen nese : — Cohort ie. - 
* Genomic information integral to trial conduct SA RB 


| (This publication has been retracted per discussion above) 


2. Bild AH et al. Oncogenic pathway signatures in human cancers as a guide to targeted therapies. Nature. 
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3. Bonnefoi, H. et.al., Validation of gene signatures that predict the response of breast cancer to neoadjuvant 
chemotherapy: a substudy of the EORTC 10994/BIGO0-01 clinical trial. Lancet Oncology, 8: 1071-1078, 
2007. (This publication has been retracted) 


¢ Patient Safety: 


5. Barry W. et. al: Utilization of genomic signatures for chemotherapy response in prospective clinical 


¢ Treatment assignments that are “Standard of Care” studies, ASCO Meeting Abstracts Jun 14, 2010: 10513 


(pCR 26% in genomically-guided v. 13% in control, 82% power) 
¢ Flexibility in patient management 


6. Barry WT et al: Intra-tumor heterogeneity and precision of microarray-based predictors of breast cancer 
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e Microarray assessment of breast cancer demonstrates disease subsets miellas 1 Distribution of OncotypeDx Recurrence Figure ome Lor- 11 (=) a od (0) | Folded Geen) am nX=)i-1k-1ela(-\-3-mej}mmm © Even in ER+ breast cancer patients, OncotypeDX provides limited 
among the major breast cancer biologic categories (ER, PR, HER-2). S (RS) the different Microarray-based Breast Prognostic treatment guidance for intermediate risk patients. Large proportions of 
cores 
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breast cancer patients are currently classified at intermediate risk per 
WV Foxe 21k OncotypeDX recurrence score. Additional prognostic / predictive 
biomarkers could be informative for improved personalized chemotherapy 
and treatment decisions. 


e These have likely additional prognostic and treatment (predictive) 
implications. 


e Additional prognostic and predictive biomarkers and optimization of existing Wel We llste> GINS eh Gl elle) HE Irelni eUteNN) Gi 


genomic platforms are needed to improve personalized breast cancer care. ¢ Previously reported microarray-based surrogate Oncotype signatures are 


generally concordant. However, they have suboptimal correlation with the 
true Oncotype score. They are limited by sample size and the availability of 
true OncotypeDX scores. 
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e The objective of our study in ER+ breast cancer patients Is: 


10 15 20 25 30 35 


Affy. model (H-K) 


Affy. model (Fan) 


e Our findings are from largely ER+/HER- patients with available 
OncotypeDX score, but are still limited by the cohort size. Further 
assessment, validation, and reproducibility of our findings are needed prior 
to clinical use. 


o To perform a cross-platform comparison of existing prognostic genomic 
signatures and the true OncotypeDx recurrence score 


5 


0 


o To perform discovery of additional prognostic / predictive genes or 
pathways in ER+ breast cancer patients 


26-30 31-100 
(IA) (AA) 


Affy. model (H-K) Oncotype DX score Oncotype DX score 


e Figure 4: Two existing Affymetrix-based microarray models for Oncotype 
OncotypeDX Score score from Fan/Perou and Haibe-Kains differ in how they pick the 
a aye ceatlanira Rieter Figure 2: General association between the microarray oncotype genes Affymetrix probe sets to represent each oncotype gene, and also differ in 
ceed pis and true OncotypeDX score was highly significant by SAFE (Barry et al). their respective algorithms. 
e Selection of early-stage ER+ patients with adequate RNA from fresh frozen ; eee inthe GAEE inlet the direct Pinecni diff tial f 
tumor specimens and a concurrent 21-gene OncotypeDX recurrence score (RS) ° Figure 1: Distribution of OncotypeDX scores for our ER+ breast cancer nme prot, tne direction oF ne microarray airerential expression OF . jlevertheless, output from these two microarray-oncotype signatures are 


were identified across independent Duke studies linked to routine prospective patients by low (L), intermediate (I), and high (H) risk OncotypeDX oncotype probe sets largely agrees with the model reported for the highly correlated with each other (left panel, rho=0.93). 


breast biospecimen collection. categories per Paik et al (intermediate scores 18-30) and per TAILORx Oncotype DX assay - with exceptions likely attributable to our cohort . . a . . 
trial (intermediate scores 11-25). being predominantly ER+/HER2-. e Despite the highly significant correlation of the microarray gene sets and 


Oncotype scores, agreement of both microarray-oncotype signatures 


oe ; with true Oncotype score is relatively poor with a prominent upward-shift 
Table 2 Association of key oda tetolel-aile: Pathway- in both microarray models (middle and right panels). e Figure 6: The following “scatter plot matrix” displays the relatedness of 


Signatu res to Oncotype AS Yoxe) X=) the various microarray models of breast cancer prognosis. 
e Microarray models show good correlation among each other for ER+ In ER+ breast cancer, we have identified 


Doe wo patients, with the exception of the GENIUS model. additional key cancer-associated genes, 
0.0012 0.0129 pathways, and genomic signatures 
associated with breast cancer prognosis, 


Local statistics 
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Sample Selection and Patient Characteristics 


e Previously reported microarray-based 
surrogate Oncotype signatures are limited 
by sample size and the availability of the 
actual OncotypeDX recurrence _ scores, 
which are necessary for a_ precise 
microarray-based Oncotype signature 
development. 
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rho = 
0.427 


e After quality control of the microarray data, and normalization for technical 
batches in the laboratory procedures, data are available for a total of 104 ER+ 
patient samples annotated with Oncotype RS and other patient and tumor [M)e)(-)9 Discovery rey i Prognostic CTV alike rele rele 


characteristics. 
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e Originally 73 patient samples were eligible. Since abstract submission an 


or : <s : . Fold change | Adjusted 
additional 31 ER+ breast cancer patients were found eligible and are included in |i" -stymevno | Gounishs| ve | cene | rulcenetone Gene Functions 
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this expanded analysis. ; ? a Proteins may be RNA editing enzymes and have 
Apolipoprotein B mRNA editing : 
: ere roles in cell growth or cell cycle control and 
enzyme, catalytic polypeptide-like 3B. 
immune response. 
Signal peptide, CUB and EGF-like Suspected breast cancer suppressor protein in 
domain-containing protein 2 part by suppression of B-catenin pathway 
oO 104 ER-positive early-stage breast cancer patients Peren 0.00033 cree, CCCTC-binding factor (zinc finger __ Involved in gene regulation, utilizes different zinc 
2 protein)-like fingers to bind varying DNA target sites. 
Role in regulating the early transport of insulin 


O Median age 55 (range 35 = 86) 204002_s at : 0.00041 ICA1 Diabetes mellitus type | autoantigen Le Ee On Sranuleioronein® 


211211 x at 0.00041 SH2D1A T cell signal transduction molecule Protein that plays a major role in the bidirectional 
Ss stimulation of T and B cells 
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e Patient Characteristics: 
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true Oncotype Score: recurrence score (RS) 
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0.0112 0.0766 
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potentially providing insights into treatment 
opportunities for ER+ breast cancer. 


0.0174 0.0766 
o Postmenopausal 63% 


: Bilaeenneype aie Cell adhesion ligand for skin epithelial cells and ; ; ; ; . F = iS ; F 
sy ee acieni 79h. Aiacanmnenican i: 242326 at 0.00082 COL22A1 Collagen, type XXII, alpha 1 1 ectess, ween * Table 2: For association with prognostic score, two oncogenic pathways RANI FOL) tu risk-of-relapse and Soe ele Seals Petisen eles et al) Training Set ew 

D t A ' ; 225728_at A 0.00096 SORBS2 Arg binding protein 2 eee coe eee ae pass a FDR of 5% (light blue), and three additional pathways pass a less 70-gene STIMU 1h \WFevaalaatsleaatayt AVETa t Veer, ATs lm fellelsw aesllarss et al) Further validation of these additional 

ate Aialys!= 207746_at 0.00096 POLQ Polymerase (DNA directed), theta Pa ioe craic ei ang stringent FDR threshold of 10% (darker blue). GENIUS (Haibe-Kains/Quackenbush et al) : 

e Expression estimates for Affymetrix H133 Plus 2.0 microarrays were reviewed for 219836_at . 0.00096 ZBED2. i nner BED domaln-containin unknown function prog nostic genes and pathways hold the 

. . . Possesses both activator and repressor cell cycle = = = 
ie seid (Owzar et al.) and normalized across batches with ComBat oe | Oe el ea vy mylobists tl oncogene ais tats the cl ison ile 2, yh 5 promise to improve personalized breast 
ohnson et al). ce @ = : ; 
228554 at : 0.00172 PGR Progesterone receptor eens physiological effects of progesterone e a = 7 a a e ) = . - r) = 8 3 cancer care In ER+ breast cancer patients, 

7 The association between mRNA expression by Affymetrix to OncotpeDx test Was 208079 _s at 0.00303. AURKA Aurora kinase A Siesta Cun Ne arla eam tumor 3 = r [| 1 i t d 1 t i k O t DX 

investigated at the feature-level using linear models for microarrays (LIMMA, a | | development and progression. — e gg Sie ic 3 especially in intermediate Fis ncotype 
omologous recombination pathway for double- . 
Smyth et al.) and the Benjamini-Hochberg method was employed to account for pve Boch gael bese eu else cence ray ene Geneon wrens eile 3 7 patients. 
multiple testing. 212021 s_at 0.00611 MKI67 fa nas ees le cellarotfercan A C EB 
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e Affymetrix-based models of prognostic breast cancer signatures, including 21- RESeRSSS AIG STANGER ‘ ‘ S 2 Cohort(s) with concurrent OncotypeDX 
gene model in Oncotype, the 70-gene model in MammaPrint, the 50-gene model 202095 s at 5 0.00799 ~=BIRCS IAP repeat containing 5 Proteins prevent apoptotic cell death z o score and recurrence-free survival data are 
in PAMS0, and the GENIUS model were computed using algorithms from Fan et A oie True RS True RS ‘ . . ne 
al and Haibe-Kains et al. Association of microarray model scores to true e The top 194 Affymetrix probe sets from the Affymetrix platform had a ce om J : required to prove the clinical validity and 
Oncotype recurrence score was assessed using Spearman rank correlation, and Pie Oncotype DX score Oncotype DX score Oncotype DX score ae . 2 . 

Lar 7 echeh cae | FDR <1%. : a utility of new prognostic biomarkers in ER+ 
gene sets to the models were tested using SAFE (Barry et al). ee a | aoa 
e Shown are (a) all genes with adjusted p < 0.001, (b) genes from early stage breast cancer patients. 
e 22 key pathway-signatures are built into the ScoreSignature module (Chang et (a) g ae sd * (0) g see y g p 
OncotypeDX that reached statistical significance, and (c) other relevant ai | : ; 

al) hosted on the Duke Institute for Genome Sciences and Policy (IGSP) - hecean oe , 7 ; e Figure 7: A shrinkage-based regression approach was used to build a References: 
instance of GenePattern. The association of predicted probabilities from each genes linked to breast cancer that were statistically significant. mie | ; | new microarray model for the true OncotypeDX recurrence score. based (1) Barry. W.T., Ann Appl Stat. 2008; 286-315. (2) Smyth G.K., Stat App! Genet Mol Biol. 2004; 3:Article3. (3) Johnson 
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Background: Numerous gene-expression signatures have been developed for breast 
cancer. However, assessments of their validity continue to largely ignore the impact of 
intratumor heterogeneity and technical variation on clinical utility. Here, the collection of 
replicate specimen in Barry et al (2010) is used to evaluate a broad collection of gene- 
expression signatures of prognosis and tumor biology. 


Methods: 18 patients with multiple frozen cores (one patient with quadruplicate, twelve 
with triplicate, and five with doublet samples) were previously identified in the Duke 
Breast SPORE tissue repository. Cores were assessed for percent invasive cancer 
cellularity, and tumor size, grade, and ER/PR status. RNA was extracted and hybridized 
to AffymetrixH133Plus2.0 microarrays. Expression signatures of prognosis and tumor 
biology were developed or translated to the Affymetrix platform using routines by Prat et 
al. (2012) and Haibe-Kains et al. (2012). Association between signatures, and precision 
among replicates are evaluated using Pearson and intraclass correlation. Coefficients are 
reported with 95% confidence intervals. 


Results: Among prognostic signatures, an imputed 70-gene signature of MammaPrint 
had the highest level of precision (ICC=0.96, 0.91-0.98); strong concordance is seen with 
Affymetrix-based PAM50 risk-of-relapse (ICC=0.82, 0.65-0.92); 16 of 18 patients had 
constant subtype calls. Substantially lower concordance was seen in the GENIUS 
prognostic model (ICC=0.62, 0.35-0.82). In ER+ patients (n=13), two algorithms to 
impute the 21-gene model of OncotypeDX recurrence score were concordant (r =0.98, 
ICC= 0.90 and 0.83) and distinct from the GENIUS ER+ score (average r=0.74, 
ICC=0.78). Imputed models for the Rotterdam 76-gene model (+ER status), GGl 
(+grade), and ROR-T (+tumor size) showed lower levels of correlation (0.47<rho<0.64), 
suggesting independent prognostic information is conveyed by the unique combinations 
of clinical and genomic factors. High correlation is seen between models of ER status 
ranging from single-gene data (ICC=0.90) to signatures from archived specimen 
(ICC=0.96) and cell lines (I\CC=0.85), with almost complete agreement to IHC results. 
Two signatures of PI3K from specimen with mutation data were partially correlated and 
concordant (r=0.52, ICC= 0.96 and 0.91), while two signatures from overexpression 
systems were uncorrelated, and one with poor precision (r=0.12, |CC= 0.92 and 0.46). 


Conclusions: The number of genomic signatures in breast cancer from archived 
specimen and cell-line experiments continues to grow, but there are limited resources for 
validating their prognostic/predictive value in patient populations. Reproducibility across 
biological replicates is a critical component in establishing clinical utility of a signature that 
is distinct from using technical replicates for the repeatability of analytes on the array 
platform. We demonstrate how archived specimen can confirm reproducibility in the ‘Test 
Validation Phase’ of biomarker development, as advocated by the Institute of Medicine, 
and inform trial designs to prospectively test clinical utility. 


Update: Additional pathway signatures of PR, 


Number 
p53 and EGFR are reported. Some of the findings 
are expanded to a second cohort of 70 replicate pl 


samples from 24 HER2- patients. Replicate pre- 
and post- treatment cores available from the 
clinical trial “Utilization of Genomic Signatures to 
Direct Use of Primary Chemotherapy in Early 
Stage Breast Cancer” (NCT00636441) were 
analyzed using the same procedures employed 


Total 
cellularity on the precision and accuracy of samples 


for Cohort 1 samples. The impact of tumor 
signatures is calculated using standard deviation . 


TABLE 1: AGREEMENT WITHIN COHORT 1 


PREDICTIVE OF TUMOR BIOLOGY 


Signature/algorithm sources: 
¢ R/Bioconducor package, genefu, by Haibe-Kains et al.: 


70-gene (cf. Mammoprint); 16-gene (cf. Oncotype); 76-gene (cf. 
Rotterdam); GGI; PIK3CA; GENIUS 
¢ R source code from the Perou laboratory (http://peroulab.med.unc.edu/): 


21-gene (cf. Oncotype); 50-gene PAM50 intrinsic subtype and risk-of- 
relapse (ROR); PI3Kwt and PI3Kmut (Hutti). 


*GenePattern module, ScoreSignature, by Gatza/Chang for genomic 
signature of oncogenic pathways (OncPath) from a variety of sources: 


AKT, Beta-catenin, E2F1, EGFR, ER, HER2, IFNa, IFNy, MYC, p53, 
PI3K, PgR, SRC, STAT3, TGF, p63, RAS, TNFa 
« R source code from Barry et al. (2010, in prep.): ER, p53 (Miller, 2005) 
* Single-probesets identified by Gong et al. (2006). 
ICC (95% Cl) 
PROGNOSTIC 
70-gene (Mammoprint) 
PAM50 ROR 
GENIUS 


21-gene and 16-gene (Oncotype DX) 
in ER+ patients. 


0.96 (0.91, 0.98) 
0.82 (0.65, 0.92) 
0.57 (0.29, 0.79) 


0.90 (0.77, 0.97) 
0.83 (0.64, 0.94) 


GGI, 76-gene (Rottingham); ROR-T -NA- 
ER: IHC in archived spec. (Barry, GSE3494) 0.97 (0.93, 0.99) 
0.93 (0.86, 0.97) 
0.85 (0.70, 0.94) 
0.90 (0.79, 0.96) 
0.90 (0.80, 0.96) 
18 of 18 concordant 
0.92 (0.84, 0.97) 
0.95 (0.89, 0.98) 
0.91 (0.82, 0.96) 
0.90 (0.80, 0.96) 
0.46 (0.16, 0.73) 
0.88 (0.79, 0.95) 
0.44 (0.14, 0.71) 


ER: single-probeset (Gong) 

ER: in vitro cell-lines (Gatza) 

PR: single-probeset (new) 

PR: mut in archived spec. (Gatza, GSE3494) 
p53: mut in archived spec. (Miller, GSE3494) 
p53: mut in archived spec. (Gatza, GSE3494) 
PI3K: PIK3CA mut. in archived spec. (Loi) 
PI3K: in vitro overexpression of mut. (Hutti) 
PI3K: in vitro overexpression of wt. (Hutti) 
PI3K: in vitro overexpression (Gatza) 
EGFR: single-probeset (new) 


EGFR: in vitro overexpression (Gatza) 


OncPath 
(>0.14, opt.) 


Single gene 
(>9.6, pre-spec.) 


Archived spec. 
(>0.63, opt.) 


Single gene OncPath Single gene OncPath 
(>4.0, opt.) (>0.19, opt.) (>8.5, opt.) (>0.5) 
: -/+ + 
om) 1 || 0 
1 | 1) 10 


As reported in Barry 2010, very strong agreement is seen with IHC, and few _- For the progesterone receptor, the gene signature 
from archived specimen (Gatza) outperforms any 
single-gene measurement on the Affymetrix array 
that is optimized for precision/accuracy. 


discordant calls (labeled ‘-/+’) are observed among replicates when using 
either a pre-specified threshold (Gong et al), or cut-points to continuous 
scores which maximize concordance among replicates. 


Conversely, for EGFR, a single-gene measure closely 
correlates with IHC and has good concordance (15 
out of 18); while the OncPath signature from an 
overexpression system performs poorly. 


FIGURE 2: SCATTERPLOTS OF AGREEMENT WITHIN PROGNOSTIC SIGNATURES 


In Cohort 1 (n = 18 patients), the genomic prognostic signatures (a) 70-gene, (b) PAM50- 
ROR, and (c) GENIUS show varying levels of agreement among replicates (Table 1). In 
each case, the five patients that were ER- by IHC were classified as having worse 
prognosis on average 


in the replicate 
Shown as_ horizontal 
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There is of minimal overlap in genes and clinical factors among the 4 signatures (9 genes 
in 3 or more). In spite of this, strong positive correlation is seen in Cohort 1. 
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Two multi-gene signatures of p53 mutation status were separately developed using the 
same set of archived specimen. Despite minimal overlap of gene-sets and different set 
sizes in each model, strong agreement and precision is seen in predictions. 
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Among four PI3K genomic signatures: (a) Strong agreement is seen in predictions based 
on the two signatures from Hutti, despite distinct forms of PI3K being transfected. 
Moderate agreement is seen with the PIK3CA signature derived from archived specimen 
despite only 4+2 overlapping genes. The OncPath signature by Gatza et al. has no 
agreement with the other signatures, and poor precision among replicates (Table 1.) 


FIGURE 5: IMPACT OF TUMOR CELLULARITY ON PRECISION AND ACCURACY 
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Combining Cohorts 1 and 2, tumor cellularity assessed by histopathology ranges from 0 — 
90%. Impact of cellularity on precision of signatures was measured by standard deviation 

(SD). The panels below for two ER signatures are consistent with other signatures. Also, 

cellularity is also shown decrease accuracy relative to ER status by IHC pretreatment. 
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For 70-gene, 50-gene and 21-gene prognostic signatures, all lose precision when (a) 
average cellularity is low, and (b) variability in cellularity is high. Further, for the PAM50 


assay, several patient samples with low cellularity are flagged as ‘Normal-like’ (red sq.) 
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The clinical utility of any genomic signature requires a strong degree of 
precision across biological replicates of tissue/blood sample. However, 
this remains underexplored in most retrospective and prospective 
validation studies. Here we use replicates from an initial cohort of 18, 
and expanded cohort of 24 patients to evaluate a series of genomic 
signatures for breast cancer that have been made publically available. 


= Affymetrix-versions of the 70-gene (Mammoprint) and 50-gene (PAM50) 
show substantial agreement by intraclass correlation, and consistently 
segregate ER+ tumors into low vs. intermediate risk. 


= Less precision is seen with the GENIUS model. 


= Prognostic models that combine genomic and clinical information are highly 
positively correlated with one another, despite largely non-overlapping factors 
to the models. 


= Less correlation is seen among independently derived signatures of PI3K 


= In general, tumor biology signatures derived from candidate genes or 
profiling archived tumors show greater precision than signatures derived from 
in vitro over-expression experiment. 


= Cellularity is shown to impact precision of signatures of tumor biology, with 
notable increases in variance in some samples with <45% cellularity. This is 
further shown to increase deviation from the pre-treatment ER status by IHC. 


" Cellularity impacts precision of prognostic signatures to a similar degree. A 
larger cohorts with outcome information and varying cellularity would be 
needed to evaluate the extent to which prognostic value is lost. 
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